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OPTIMIZED T-DNA TRANSFER AND VECTORS THEREFOR 



Field of Invention 

The present invention relates to the field of molecular biology. More particularly, it 
5 describes methods for the incorporation of foreign DNA into the genome of eukaryotic 
cells such as plant cells. The specifics of the current invention lie in the design of the 
border repeats flanking Agrobacterially transferred T-DNA such that read-through 
through the left T-DNA border is significantly decreased and/or that integrated vector 
backbone DNA can easily be removed. Thus, the frequency of obtaining transgenic 
1 0 eukaryotic cells containing only the T-DNA is increased. 

Background to the invention 

Improved plant varieties have been obtained by 'classical' crossbreeding ever since 
man exchanged nomadic existence for permanent settlement. In the more recent 

1 5 history, scientists started to unravel the behavior of genetic material during crossings 
and plant breeders could, and still do benefit from the knowledge contained within the 
Mendelian laws predicting the distribution of a given genetic trait in the offspring of a 
crossing. With the advent of plant molecular biology plant breeders can, with an ever 
increasing precision, insert novel chimeric genes into the genome of a plant. A variety 

20 of techniques is nowadays available to mediate genetic transformation of plants 
including agrolistics, microinjection, electroporation, direct gene transfer and 
bombardment with DNA-coated particles. A preferred and widely used plant 
transformation system makes use of the soil bacterium Agrobacterium (Zupan and 
Zambryski 1995, Gelvin 1998a, Gheysen et al. 1998). Nowadays, Agrobacterium is not 

25 only used to transform plants but also to transform yeast, moulds and filamentous fungi 
(Bundock et at. 1995, de Groot et al. 1998, Gouka et al. 1999. W098/45455). It has 
furthermore been shown that components of the T-DNA production and transfer 
machinery of Agrobacterium are useful for import of DNA into nuclei of mammalian cells, 
opening perspectives for use of these components in gene therapy (Ziemienowicz et al. 

30 1 999). Agrobacterium transfers into a eukaryotic cell nucleus any DNA located on the T- 
DNA. This T-DNA is part of the wild-type Ti- (in case of Agrobacterium tumefaciens) or 
Ri-plasmid (in case of A. rhizogenes). Wild-type T-DNA carries the genes causing, after 
integration in the plant genome, crown gall tumors or the hairy root syndrome in case of 
infection with A. tumefaciens ox A. rhizogenes, respectively. Also located on the wild-type 

35 Ti- or Ri-plasmids are vir genes (virulence genes) which are activated by plant phenolic 
compounds. Products of the vir genes are responsible for the transfer of the T-DNA into 
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the eukaryotic genome. For transformation purposes, the T-DNA is disarmed (i.e. all 
disease-causing genes are removed) and vir genes are supplied either in trans on a 
helper plasmid (the T-DNA encompassing heterologous gene(s) is then located on a 
second binary plant transformation vector) or in cis in case of a co-integrate plant 
5 transformation vector. The heterologous genes of interest are cloned in between the two 
T-DNA 22 bp (in case of octopine Ti plasmids) or 25 bp (in case of nopaline Ti plasmids) 
imperfect border core sequences constituting to the right border (RB) and the left border 
(LB), that are the only in cis elements necessary to direct T-DNA processing. The border 
core sequences in RB and LB are organized as imperfect repeats. 
10 The VirD1 and VirD2 proteins produce a single-stranded nick between the third and 
fourth base in the bottom strand of each border repeat (Yanofsky et al. 1 986). Increased 
levels of VirD1 and VirD2 enhance the production of T-DNA complexes inside 
Agrobacterium and result in an increased plant transformation efficiency (Wang et al. 
1990). 

1 5 For many years, it was believed that only the DNA between the repeats, the T-DNA, and 
not the vector DNA external to the T-DNA was transferred to the eukaryotic cell. 
However, recent and more detailed characterization of the DNA inserts in transgenic 
plants demonstrates that also vector backbone sequences integrate very frequently into 
the plant genome (Martineau et al. 1994, Ramanathan and Veluthambi 1995, Cluster et 

20 al. 1996, van der Graaff et al. 1996, Kononov et al. 1997, Wenck et al. 1997, Wolters et 
al. 1998). 

The authors of the present invention have previously found that the frequency of 
integration of vector sequences is not influenced by the plant species or the 
transformation method used. This is consistent with the view that transfer of vector 

25 backbone sequences is the consequence of read-through past the LB, a process that is 
occurring within the Agrobacterium cells and is most probably determined by factors 
within these cells. It should, however, be noticed that others reported vector backbone 
integration in 33% of Arabidopsis transformants obtained via root transformation and in 
up to 62% of transformants obtained via vacuum infiltration (Wenck et al. 1997). This 

30 implies that the transformation method used could be another factor influencing the 
frequency of vector backbone integration. Integration of vector backbone sequences has 
been reported to occur in many plant species including Petunia (Virts and Gelvin 1985, 
Cluster et al. 1996), Arabidopsis (Van der Graaff et al. 1996, Wenck et al. 1997), tobacco 
(Ramanathan and Veluthambi 1995, Kononov et al. 1997, Wenck et al. 1997), and potato 

35 (Wolters et al. 1998). Vector backbone integration is apparently independent of the type 
of Agrobacterium strain used for plant transformation (Kononov et al. 1997). 
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The inventors have previously analyzed different series of transformants for the presence 
of vector backbone sequences by using specific PCR reactions and DNA gel blot 
analysis. Three different transformation methods in two different plant species were 
evaluated, namely Arabidopsis thaliana root and leaf transformation and Nicotiana 

5 tabacum protoplast and leaf transformation. Finally, the influence of the replicon type, the 
ColE1 and pVS1 replicons, was evaluated. The results showed that neither the plant 
species nor the explant type used for transformation, the replicon type or the selection 
have a major influence on the frequency with which integration of vector sequences 
occurred. In the past, it was postulated that this transfer of vector DNA that does not 

10 belong to the T-DNA could be the result of read-through at the 4eft border, which would 
prevent the normal termination of the T-DNA transfer. Alternatively, DNA transfer could 
start at the left border and proceed towards the right border (Ramanathan and 
Veluthambi, 1995; van der Graaff et al., 1996). In the transgenic plants described above, 
however, it was observed that many contain vector backbone sequences linked to the 

1 5 left border as well as vector junctions with the right T-DNA border. DNA gel blots indicate 
that in most of these plants the complete vector sequence is integrated. Therefore, it was 
postulated that integration in the plant genome of complete vector backbone sequences 
can be the result of a conjugative transfer initiated at the right border and subsequent 
continued copying at the left and right borders, called read-through. This model implies 

20 that the left border is not frequently recognized as an initiation site for DNA transfer and 
that the right border is not efficiently recognized as a termination site for DNA transfer. 
These observations comply with the results of previous work showing that the right 
border region is intrinsically more active than the left border region in promoting T-DNA 
transformation (Jen and Chilton 1986a,b, Caplan et al. 1985). From all available data, it 

25 can be concluded that T-strand formation starts much more frequently at the right border 
than at the left border region. 

In the future, it will be of utmost importance to prevent or to cure vector backbone 
integration as a consequence of >*grobacfer/um-mediated transformation. Firstly, 
regulatory authorities are demanding that transgenic plants to be released on the 

30 common marketplace are free of vector backbone sequences. Such backbone 
sequences can carry bacterial origins of replications, bacterial antibiotic resistance 
genes and possibly a number of other (foreign) genes. The same rigorous concerns will 
also be expressed by consumers who are becoming increasingly aware of such 
potential hazards associated with plant biotechnology. Secondly, also from a scientific 

35 point of view it is desirable not to have vector backbone integration in the genome of 
plants. Such sequences can influence transgene expression (Iglesias et al. 1997, 
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Matzke and Matzke 1998, Jakowitsch et al. 1999). Vector backbone integration is also 
likely to interfere with T-DNA tagging experiments. Tags can be considerably longer than 
expected (Martineau et al. 1994) and vector backbone integration might also be the 
explanation for the fact that in a large percentage of T-DNA tagged Arabidopsis plants 
5 the T-DNA is not co-segregating with a mutant phenotype (Errampali et al. 1991, 
Feldmann 1991 , Koncz et al. 1992, Van Lijsebettens et al. 1991). 

Although the phenomenon of occasional vector backbone integration was already 
encountered as early as 1982 by Ooms et al., it lasted till 1995 for a first solution to be 
suggested by Ramanathan and Veluthambi (1995): "...novel binary vectors with 'stop- 
10 transfer" signals adjacent to the left border may be constructed". Since then, one has 
tried to understand the mechanism of vector backbone integration. A possible reason 
was discussed by Wenck et al. (1997): "...inefficient nicking may be due to low 
amounts of virulence proteins, primarily VirD2". Only very recently, however, methods 
were disclosed that prevent read-through at the T-DNA borders (WO99/01563) and by 
15 Hanson et al. (1999). These methods are based on including sequences outside the 
borders. These sequences are either genes coding for toxic compounds or sequences 
capable of interacting with DNA-binding proteins or sequences that are enriched in 
G+C nucleotides. A major drawback of the methods described in WO99/01563 and 
Hanson et al. (1999) is that the regeneration of plant transformants carrying more than 
20 the T-DNA region in their genome is prevented. It is conceivable that such methods 
impair the overall transformation efficiency, i.e. lower numbers of transformants willbe 
obtained from a given transformation experiment. It was indeed reported by Hanson et 
al. (1999) that tobacco transformation efficiencies drop by as much as 30%. 
Nevertheless, Hanson et al. (1999) described their approach as a useful tool lor the 
25 elimination of non-T-DNA sequences from transgenic individuals'. 

The current invention describes a solution to the technical problem of undesired vector 
backbone integration and provides advantages over existing methods. 

Summary of the Invention 

30 The invention describes transformation vectors comprising a T-DNA with flanking T- 
DNA borders. The T-DNA vectors are characterized in that they are modified such that 
they allow genetic transformation of a eukaryotic cell only with the T-DNA and not with 
vector backbone sequences. This is accomplished either by preventing transfer of 
vector backbone sequences to the genome of a eukaryotic cell or by curing of vector 

35 backbone sequences transferred to the genome of a eukaryotic cell. Said modified T- 
DNA vectors allow for an efficient processing of the left border by the nicking complex 
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involving at least VirD1 and VirD2 or allow for excision of transferred vector backbone 
sequences. 

A first embodiment of the invention comprises optimized T-DNA vectors including a 
modification of the T-DNA right border comprising a single right border core sequence 
5 flanked by a right border outer region and/or a modification, including multiplication, of 
the left T-DNA border designed as: 

a) a single left border core sequence flanked by a natural left border outer region and 
an intra-T-DNA left border proximal region with a length of 10 to 100 bp, preferably 
20-100 bp, and which is enriched in the number of A- and T-residues, the 

10 percentage of AT-residues preferably being 60 to 85%, more preferably being 64 to 

80%, most preferably being 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78 or 79 
%, provided that said intra-T-DNA left border proximal region is not the 
corresponding natural octopine type left border inner region, or, 

b) a single left border core sequence flanked only by a natural left border inner region; 
15 or, 

c) a single left border core sequence; or 

d) a tandem repeat of several left border core sequences flanked only by a natural left 
border outer region, with the tandem repeat preferably containing 2-3 left border 
core sequences but not excluding a higher copy number of the left border core 

20 imperfect repeat and with said repeated left border core sequences in said tandem 

being separated by a sequence of at least 10-20 bp optionally carrying stop codons 
in the three reading frames and in both directions; or 

e) an integral nopaline-type left border region adjacent to and downstream or 
upstream of the integral octopine-type left border region. 

25 Another embodiment of the invention comprises optimized T-DNA transformation 
vectors with additional DNA sequences outside the T-DNA border core repeats 
enabling post-transformational removal of integrated vector backbone sequences. Said 
additional DNA sequences modify, including multiply, the T-DNA border regions or 
parts thereof and comprise: 
30 f) recombination sites organized as repeats downstream of the left border core 
sequence and upstream of the right border core sequence; or 
g) said DNA sequences of (f) further modified by adding a second copy of a left border 
region positioned upstream of and preferably adjacent to the single right border 
outer region and said recombination site upstream of the core sequence of said 
35 second left border region; or 
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h) said recombination sites of (f) with a recombinase gene, downstream of said 
recombination site downstream of the left border core sequence, and preferably, 
when present, adjacent to and downstream of the left border outer region; or 

i) a DNA sequence located downstream of the left border region, said DNA sequence 
5 comprising a recombinase gene flanked by repeats of recombination sites as 

defined in (h) and further comprising a second copy of a left border region 
downstream of said recombinase; or 
j) the DNA sequence of (i) with additional recombination sites organized as repeats 
downstream of the second left border core sequence and upstream of the single 

1 0 right border core sequence; 

In any of said modifications (f), (g), (h), (i) or (j), said recombination sites are located 
adjacent to and downstream and/or upstream of the left- and/or right border core 
sequences or are separated from the left- and/or right border core sequences by a 
sequence of at least 10-20 bp in length optionally carrying stop codons in the three 

1 5 reading frames and in both directions. 

In any of said modifications (f) f (g) f (h), (i) or (j), said recombination sites are either site- 
specific recombination sites arranged as direct repeats or transposon border 
sequences arranged as inverted repeats and said recombinase gene is either a site- 
specific recombinase gene or a transposase gene, respectively. 

20 Another embodiment of the invention includes transformation vectors in which any of 
said modifications as defined in (a) to (j), is applied, or are applied in combination. 
These transformation vectors comprise vectors used in Agrobacterium-mediated 
transformation including octopine-type binary transformation vectors, nopaline-type 
binary transformation vectors, co-integrate type transformation vectors, super-binary 

25 type transformation vectors, Ri-derived transformation vectors as well as T-DNA 
carrying vectors used in agrolistic transformation or gene therapy. 
The current invention comprises a method for obtaining transgenic eukaryotic cells 
transformed only with the T-DNA by preventing transfer of vector backbone sequences 
using said optimized transformation vectors containing any of said modifications (a) to 

30 G). 

Further comprised in the current invention is a method for obtaining transgenic 
eukaryotic cells transformed only with the T-DNA by enabling curing of said 
transformed cells containing vector backbone sequences using said optimized 
transformation vectors containing said DNA sequence of (f) or (g) in combination with 
35 the supply of a site-specific recombinase or transposase, or of said DNA sequences 
(h), (i), or (j) eventually in combination with the supply of a site-specific recombinase or 
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a transposase. With curing is meant the removal of transformation vector backbone 
sequences possibly originating from said vector without abolishing the T-DNA 
transformation event. 

Also part of the invention is a method for obtaining transgenic plants, yeasts, moulds or 
5 filamentous fungi which do not contain vector backbone sequences by transforming 
plants, yeasts, moulds or filamentous fungi with a transformation vector modified 
according to the invention and/or implying a method of the invention. 
The invention further comprises a method for increasing the production by 
Agrobacterium of the nicking complex at least involving VirD1 and VirD2. Said nicking 

1 0 complex components are encoded by the v/rD-operon (octopine-type or nopaline-type) 
of which at least one extra copy is integrated in chromosomal and/or 
extrachromosomal DNA entities contained and maintained within an Agrobacterium 
strain. To decrease T-DNA left border read-through, said method can be applied alone 
or in conjunction with transformation vectors containing modified T-DNA borders. 

15 Further comprised in the current invention are any combinations of methods and/or T- 
DNA vector modifications of the invention to prevent or to cure integration of vector 
backbone sequences with any other methods and/or T-DNA vector modifications 
applied to prevent or to cure integration of vector backbone sequences. 
Hosts containing said transformation vectors modified according to the present 

20 invention, like bacteria, preferably Agrobacterium tumefaciens also constitute to the 
invention. 

Also comprising the invention are any combinations of the site-specific recombination 
system or transposase-mediated recombination system of the current invention to cure 
integration of vector backbone sequences with the same or any other site-specific 
25 recombination system or transposase-mediated recombination system used for any 
other purpose. 

Also comprising within the invention are methods for agrolistic-based transformation of 
a eukaryotic cell using a T-DNA vector modified according to the alterations as defined 
in the vectors as mentioned above. 
30 Also comprising within the invention are methods for gene therapy using a T-DNA 
vector modified according to the alterations as defined in the vectors as mentioned 
above. 

Transgenic plant cells or plants obtainable by an Agrobacterium-mediated 
transformation method as defined above, part of such a plant or progeny thereof also 
35 constitute the present invention. 
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Transgenic yeasts, moulds or filamentous fungi obtainable by an Agrobacterium- 
mediated transformation method as defined above also constitute the present 
invention. 

With the foregoing and other objects, advantages and features of the invention that will 
5 become hereinafter apparent, the nature of the invention may be more clearly 
understood by reference to the following detailed description of the preferred 
embodiments of the invention and to the appended claims. 

Detailed Description of the Invention 

10 Agrobacterium-n\ed\a\ed transformation or agrolistic transformation of plants, yeast, 
moulds or filamentous fungi is based on the transfer of part of the transformation vector 
sequences, called the T-DNA, to the nucleus and on integration of said T-DNA in the 
genome of said eukaryote. 

With 'Agrobacterium 1 is meant a member of the Agrobacteriaceae. more preferably 

1 5 Agrobacterium or Rhizobacterium and most preferably Agrobacterium tumefaciens. 

With T-DNA' , or transferred DNA, is meant that part of the transformation vector 
flanked by T-DNA borders which is, after activation of the Agrobacterium vir genes, 
nicked at the T-DNA borders and is transferred as a single stranded DNA to the 
nucleus of an eukaryotic cell. T-DNA strand production is the result of a DNA-processing 

20 event initiated by a site-specific nick in a double-stranded molecule. A family of 
interrelated DNA-processing reactions exists in different prokaryotic systems. In each 
case, the substrate is a supercoiled circular DNA molecule and the nicking protein 
recognizes a short sequence, cleaves at a conserved site (conserved nick site positions 
and consensus region located 3* of the nick), and attaches covalently to the 5* end of the 

25 nicked strand. DNA-processing systems that belong to this family include, besides the 
formation of T-DNA in and transfer of the T-DNA into the eukaryotic cell by 
Agrobacterium, (1 ) the initiation of conjugative DNA replication and subsequent transfer 
of plasmids of Gram-negative bacteria, (2) the initiation of plus-strand synthesis during 
rolling circle replication of <>X 174- related phage, and (3) the initiation of rolling circle 

30 replication in certain plasmids of Gram-positive bacteria (Waters and Guiney 1993). The 
DNA processing event is catalyzed by an oligoprotein relaxosome complex. One of the 
proteins in this complex, the relaxase, is the key enzyme catalyzing the site- and strand- 
specific DNA cleavage (Pansegrau and Lanka 1996). In the case of conjugative transfer 
of plasmids the relaxase covalently binds to the 5' terminus of the DNA to be transferred 

35 and is not able to produce a second nick necessary for transfer of a single copy of the 
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DNA. It is thought that at least a relaxase dimer is required for single copy DNA transfer 
(Pansegrau and Lanka 1996). 

In the case of Agrobacterium, a minimum of two proteins are required for nicking and 
subsequent T-DNA transfer. These relaxosome proteins are VirD1 and VirD2. VirD1 is a 
5 type I topoisomerase without sequence specificity (Ghai and Das 1989). The VirD2 
protein is, also on the basis of sequence homology, the relaxase of Agrobacterium and 
acts as the nicking endonuclease (Pansegrau et al. 1994). During the nicking process, 
VirD2 gets covalently attached to both 5* ends of the processed T-DNA at the right 
border and to the rest of the T-DNA plasmid at the left border (Durrenberger et al. 1989, 

10 Young and Nester 1988). Both T-DNA left and right borders are recognized by VirD1 and 
VirD2. Strong interactions between VirD1 and VirD2 and amongst VirD2 proteins have 
been demonstrated and it was suggested that T-DNA strand displacement (by DNA 
replication) is terminated by producing a second nick depending on VirD2-VirD2 
interaction. This implies that separate relaxosome complexes including at least VirD1 

1 5 and VirD2 might be involved in processing each of the two T-DNA borders (Relic et al. 
1998). 

With T-DNA borders', T-DNA border region', or 'border region' are meant either right 
T-DNA border (RB) also referred to as 'right border' or left T-DNA border (LB) also 
referred to as 'left border'. Such a border comprises a core sequence flanked by a 

20 border inner region as part of the T-DNA flanking the border and/or a border outer 
region as part of the vector backbone flanking the border. The core sequences 
comprise 22 bp in case of octopine-type vectors and 25 bp in case of nopaline-type 
vectors. The core sequences in the right border region and left border region form 
imperfect repeats. Border core sequences are indispensable for recognition and 

25 processing by the Agrobacterium nicking complex consisting of at least VirD1 and 
VirD2. Core sequences flanking a T-DNA are sufficient to promote transfer of said T- 
DNA. However, efficiency of transformation using transformation vectors carrying said 
T-DNA solely flanked by said core sequences is low. Border inner and outer regions 
are known to modulate efficiency of T-DNA transfer (Wang et al. 1987). One element 

30 enhancing T-DNA transfer has been characterized and resides in the right border outer 
region and is called overdrive (Peralta et al. 1986, van Haaren et al. 1987). Core 
sequences are as follows (inn: part of T-DNA sequence; out: part of vector backbone 
sequence; core sequences indicated in bold and underlined capitals; Gielen et al. 
1984, 1999): 

35 -octopine-type RB: inn-tgatqctaact GGCAGGATATATACCGTTGTAA Tttqaqctcqt-out 
(SEQ ID NO 1) 
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-octopine-type LB: Qut-gnqgragcaac GGCAGGATATATTCAATTGTAA AtQQCttcatQ-inn 
(SEQ ID NO 2) 

-nopaline-type RB: inn>tatcagtQtt TGACAGGATATATTGGCGGGTAAAC ctaaqaqaa-out 
(SEQ ID NO 3) 

5 -nopaline-type LB: out-aactQQctQQ TGGCAGGATATATTGTGGTGTAAAC aaattgacg- 
inn (SEQ ID NO 4) 

With Integral border region 1 is meant a naturally occurring T-DNA border comprising 
the border core sequence and both the border inner and outer regions. 
With T-DNA transformation vector' or T-DNA vector' is meant any vector 
10 encompassing a T-DNA sequence flanked by a right and left T-DNA border consisting 
of at least the right and left border core sequences, respectively, and used for 
transformation of any eukaryotic cell. 

With Vector backbone sequence 1 or 'vector backbone sequences' is meant all DNA of 
a T-DNA containing vector that lies outside of the T-DNA borders and, more 

15 specifically, outside the nicking sites of the border core imperfect repeats. With Vector' 
is meant a transformation vector or T-DNA vector stably maintained in a bacterial 
culture, e.g. an Escherichia coli culture or an Agrobacterium culture. 
The current invention includes constructs of optimized T-DNA vectors such that vector 
backbone integration in the genome of a eukaryotic cell is minimized or absent or such 

20 that curing of vector backbone sequences integrated in a eukaryotic cell is possible. 
Said vectors, any derivative thereof or any vector utilizing any of the modifications or 
any combination of modifications of the present invention can then be used as starting 
material for cloning of the DNA sequences of interest between the two border repeats 
and for subsequent applications such as transformation of e.g. a crop plant, a yeast or 

25 fungus and such as gene therapy. 

With 'optimized T-DNA vector' is meant a T-DNA vector designed either to decrease or 
abolish transfer of vector backbone sequences to the genome of a eukaryotic cell or to 
allow curing of vector backbone sequences transferred to the genome of a eukaryotic 
cell. 

30 In analyses performed in the frame of the current invention (see Examples 1-3), the 
frequency of vector backbone transfer was compared in transgenic plants that were 
transformed with T-DNA vectors with border repeats in the natural octopine sequence 
context, and with T-DNA vectors without the inner border region of the border repeats. 
Substantially more vector backbone integration was found in the series of transformed 

35 plants in which T-DNA vectors without inner border repeat sequences were employed 
than in the series of transformed plants in which complete inner and outer border regions 
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were used. It was further observed that many transgenic plants contain vector 
backbone sequences linked to the left T-DNA border as well as vector junctions with 
the right T-DNA border (see Example 2). DNA gel blot analyses indicate that in most of 
these plants the complete vector sequence is integrated (see Example 3). Moreover, 
5 the frequency of integration of complete vector backbone sequences was not 
influenced by the presence or absence of the border inner region. These data suggest 
that the right border inner region most probably does not contain determinants 
important for efficient nicking at the left T-DNA border. This conclusion is further 
substantiated by earlier results obtained by Shaw et al. (1984) indicating that deletion 
10 of the RB inner region is not influencing efficiency of plant transformation. Determinants 
for efficient nicking at the left T-DNA border are thus most likely to reside within the LB 
itself. It is assumed that integration in the plant genome of complete vector backbone 
sequences is the result of a conjugative transfer initiated at the right border and 
subsequent continued copying at the left and right borders, called read-through. Vector 
15 backbone transfer can also be the consequence of recognition of the LB as a starting 
point for DNA transfer and continued copying at the RB up to the LB (van der Graaff et 
al. 1996). It is therefore important to identify LB elements involved in the efficient 
nicking at the LB core repeat. Thus, e.g. partial or complete deletion of the left border 
region inner and/or outer regions could put the left border core repeat in a context 
20 enhancing its affinity for and recognition by VirD1 and VirD2. A striking observation with 
respect to this comes from analysis of left (and right) border sequences of the 
transformation vectors used in experiments described in Examples 2 and 3. In Figure 
1, the percentages of A- and T-residues per 100 bp stretch around (and not including) 
the LB and RB core repeats is drawn for both transformation vectors with and without 
25 the natural border inner region context. The remarkable observation consists of the fact 
that the percentage of AT-residues of the LB proximal 100 bp of the natural inner 
region (K T-DNA vector in Figure 1) is considerably higher than the percentage of AT- 
residues of the LB proximal 100 bp of the vector in which the natural inner region is 
deleted (Hsb T-DNA vector in Figure 1). i.e. 64% compared to 56%, respectively. As 
30 mentioned supra, LB read-through and partial vector backbone transfer occurs much 
more frequently when plants are transformed with Hsb T-DNA vectors when compared 
to plant transformation with the K T-DNA vector. Mechanistically, it is possible that this 
lower percentage of AT-residues considerably attenuates the nicking activity of the LB- 
bound relaxosome including at least VirD1 and VirD2. Concurrently, the DNA 
35 replication machinery effecting the T-DNA strand displacement would, at relatively high 
frequencies, be able to displace the relaxosome from the LB before the nick has been 



□Mcnmn- --wn 



WO 01/44482 PCT/EP00/12683 

12 

realized. An increased frequency of read-through at the LB would be the consequence 
of such a process. Thus, the percentage of AT-residues of the 10-100 bp (or preferably 
20-100 bp) proximal to the LB and part of the T-DNA is a candidate element of the LB 
determining the efficiency of nicking at the LB core repeat. For nicking efficiency to be 
5 sufficiently high, the percentage of AT-residues of said LB proximal region is to be 
sufficiently high, i.e. at least about 60-85%, more preferably at least about 64-80%. 
Although the proximal 100 bp outside of the T-DNA RB core repeat contains a similar 
percentage of AT-residues as the proximal 100 bp inside of the T-DNA LB core repeat, 
i.e. 58% compared to 56%, respectively (see Figure 1), correct and highly efficient 

10 processing at the RB is not problematic. This is most likely due to the positive influence 
of the overdrive sequence present in the RB outer region (Peralta et al. 1986, van 
Haaren et al. 1987). It is not to be excluded that proteins auxiliary to the relaxosome 
complex bind to the overdrive sequence and enhance the process of RB nicking. This 
would again resemble the situation of conjugative plasmid transfer between bacteria 

15 where such auxiliary proteins have been identified (Waters and Guiney 1993). It is also 
not to be excluded that auxiliary proteins, at present not yet identified, are involved in 
efficient processing of the LB. 

The first modified T-DNA vector construct for preventing vector backbone transfer 
described supra includes an intra-T-DNA LB proximal region enriched in A- and T- 

20 residues. Such an approach will, however, not be practicable for all T-DNA constructs. 
Thus, in Figure 2, a number of additional optimized T-DNA vector constructs are 
schematically drawn. Results of plant transformation with the first of the constructs of 
Figure 2 are included in the present invention (see Examples 1-3). The other constructs 
underlie several approaches to obtain optimized T-DNA vectors. The optimizations 

25 reside in modifications of the left border design aimed at increasing the efficiency of LB 
nicking and resulting in vector backbone integration in the genome of a eukaryotic cell 
that is highly minimized or absent. In said optimized T-DNA vector constructs, the RB 
region is constant, i.e. it comprises the right border outer region but the right border 
inner region is deleted. Said right border region contains the 25 bp RB core sequence 

30 and the upstream 146 bp of the right border outer region derived from pTiAch5 (Gielen 
et al. 1984). This outer region contains the overdrive sequence. It is clear that right 
border regions in their natural context, i.e. comprising the right border inner region can 
be used as well in the current invention. Right borders from which the border inner 
regions have been removed have, however, the advantage that transfer to the plant of 

35 "foreign" DNA without a clear function and as part of the T-DNA is further limited. 
Furthermore, it has been discussed supra that the absence of the RB inner region very 
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unlikely influences efficiency of the LB processing. The LB regions in said exemplary 
optimized T-DNA vectors are generally derived from octopine-type Ti plasmids and all 
contain a natural left border core sequence. Nopaline-type T-DNA borders are usually 
derived from Ti-plasmids such as pTiC58 (Gielen et al. 1999). More details about 
5 exemplary border regions used in exemplary optimized T-DNA vectors of the invention 
are described in Figure 2 and Example 4. 

A first embodiment of the invention comprises optimized T-DNA vectors as exemplified 
in Figure 2 and including a modification of the T-DNA right border comprising a single 
right border core sequence flanked by a right border outer region and/or a modification, 
1 0 including multiplication, of the left T-DNA border designed as: 

a) a single left border core sequence flanked by a natural left border outer region and 
an intra-T-DNA left border proximal region with a length of 10 to 100 bp, preferably 
20-100 bp, and which is enriched in the number of A- and T-residues, the 
percentage of AT-residues preferably being 60 to 85%, more preferably being 64 to 

15 80%, most preferably being 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78 or 79 

%, provided that said intra-T-DNA left border proximal region is not the 
corresponding natural octopine type left border inner region; or, 

b) a single left border core sequence flanked only by a natural left border inner region; 
or 

20 c) a single left border core sequence; or 

d) a tandem repeat of several left border core sequences flanked only by a natural left 
border outer region, with the tandem repeat preferably containing 2-3 left border 
core sequences but not excluding a higher copy number of the left border core 
imperfect repeat and with said repeated left border core sequences in said tandem 

25 being separate by a sequence of at least 10-20 bp optionally carrying stop codons 

in the three reading frames and in both directions; or 

e) an integral nopaline-type left border region adjacent to and downstream or 
upstream of the integral octopine-type left border region. 

Incorporation of modified left border regions displaying efficient nicking at the LB core 
30 sequence in commonly used transformation vectors will prevent LB read-through. An 
additional advantage of the tandem arrangement of the LB core sequences or of 
integral left borders lies in the fact that occasional read-through or DNA transfer 
starting at the first copy of said LB core sequence or integral LB region will be 
terminated at an adjacent copy of said LB core sequence or integral LB region in said 
35 tandem. In both cases comprised within the current invention, the frequency of transfer 
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of vector backbone sequences and integration of said sequences in a eukaryotic 
genome is decreased. 

With •downstream* and 'upstream 1 are meant any sequences located 5' and 3\ 
respectively, of any other sequence. As a reference, the T-DNA strand is taken which 
5 is starting at its 5' end at the RB and terminating at its 3' end at the LB. 

Further optimization of T-DNA transformation vectors is envisaged once it is known 
whether the left border inner and/or outer regions is/are necessary to avoid vector 
backbone integration. Several deletion derivatives of said inner and/or outer regions 
can be made to determine which specific sequence(s) in said inner and/or outer border 

1 0 regions is/are important in determining termination of T-DNA transfer. Once this/these 
sequences is/are known, it/they can be included in more copies around the LB repeat 
resulting in a more efficient nicking. Such left borders capable of further optimizing T- 
DNA transformation vectors and said vectors containing said further modified left 
borders are also subject of the present invention. 

15 The current invention also embodies incorporation of any of said modified left borders 
in any T-DNA vector comprising binary transformation vectors, super-binary 
transformation vectors, co-integrate transformation vectors, Ri-derived transformation 
vectors as well as in T-DNA carrying vectors used in agrolistic transformation or gene 
therapy. 

20 With 'binary transformation vector 1 is meant a T-DNA transformation vector comprising: 

a) a T-DNA region comprising at least one gene of interest and/or at least one 
selectable marker active in the eukaryotic cell to be transformed; and 

b) a vector backbone region comprising at least origins of replication active in E. coli 
and Agrobacterium and markers for selection in E. coli and Agrobacterium. 

25 Alternatively, replication of the binary transformation vector in Agrobacterium is 
dependent on the presence of a separate helper plasmid. The binary vector pGreen 
and the helper plasmid pSoup form an example of such a system as described in e.g. 
Hellens et al. (2000), Plant Mol. Biol. 42, 819-832, or as available on the internet site 
http://www.pgreen.ac.uk. 

30 The T-DNA borders of a binary transformation vector can be derived from octopine- 
type or nopaline-type Ti plasmids or from both. The T-DNA of a binary vector is only 
transferred to a eukaryotic cell in conjunction with a helper plasmid. 
With 'helper plasmid* is meant a plasmid that is stably maintained in Agrobacterium and 
is at least carrying the set of vir genes necessary for enabling transfer of the T-DNA. 

35 Said set of vir genes can be derived from either octopine-type or nopaline-type Ti 
plasmids or from both. 
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With 'super-binary transformation vector' is meant a binary transformation vector 
additionally carrying in the vector backbone region a vir region of the Ti plasmid 
pTiBo542 of the super-virulent A tumefaciens strain A281 (EP0604662, EP0687730). 
Super-binary transformation vectors are used in conjunction with a helper plasmid. 
5 With 4 co-integrate transformation vector' is meant a T-DNA vector at least comprising: 

a) a T-DNA region comprising at least one gene of interest and/or at least one 
selectable marker active in plants; and 

b) a vector backbone region comprising at least origins of replication active in 
Escherichia coli and Agrobacterium, and markers for selection in E. coli and 

1 0 Agrobacterium. 

The T-DNA borders and said set of vir genes of a said T-DNA vector can be derived 
from either octopine-type or nopaline-type Ti plasmids or from both. 
With 'Ri-derived plant transformation vector' is meant a binary transformation vector in 
which the T-DNA borders are derived from a Ti plasmid and said binary transformation 

1 5 vector being used in conjunction with a 'helper' Ri-plasmid carrying the necessary set 
of vir genes. 

With 'agrolistics', 'agrolistic transformation' or 'agrolistic transfer' is meant here a 
transformation method combining features of /\grobacfer/L/m-mediated transformation 
and of biolistic DNA delivery. As such, a T-DNA containing target plasmid is co- 

20 delivered with DNA/RNA enabling in planta production of VirD1 and VirD2 with or 
without VirE2 (Hansen and Chilton 1996; Hansen et al. 1997; WO97/12046). 
Two possible mechanisms leading to transfer and integration of vector backbone 
sequences in an eukaryotic genome are recognized. In a first mechanism, T-DNA 
transfer legitimately starts at the right T-DNA border but does not stop at the left T-DNA 

25 border and is followed by continued copying of vector backbone sequences. This 
mechanism is known as left border read-through. In a second mechanism, DNA 
transfer is illegitimately initiated at the left T-DNA border with continued copying till the 
left T-DNA border is reached again. Irrespective of which mechanism prevails, analysis 
of transgenic plants indicates that many of them have the complete vector backbone 

30 sequences integrated into their genome. The present invention further describes an 
approach to cure transformed cells containing vector backbone sequences by 
recombination involving a recombinase and recombination sites. 

With 'curing' is meant here the removal of transformation vector backbone sequences 
without abolishing the T-DNA integration event. Another tool for the elimination of non- 
35 T-DNA sequences from transgenic individuals has been described by Hanson et al. 
(1 999). This tool is, however, different and not related to the curing method described 
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in the present invention. As discussed supra, elimination of non-T-DNA sequences 
according to the methods of Hanson et al. (1 999) or WO99/01 563 has the drawback of 
decreasing the transformation efficiency (Hanson et al. 1999). 

Thus, in another embodiment of the invention, curing of transformed cells is obtained 
5 by a recombination event as the method for removing the illegitimately integrated 
vector backbone sequences. As a result, transformed cells are obtained which retain 
the T-DNA in their genome. Contrary to the methods of Hanson et al. (1999) or 
WO99/01563, the curing method of the current invention rescues T-DNA containing 
transgenic cells which previously carried illegitimately integrated vector backbone 
1 0 sequences. Transformation efficiencies are thus hardly affected. 

With 'recombination event' is meant either a site-specific recombination event or a 
recombination event effected by transposon 'jumping'. 

With 'recombinase' is meant either a site-specific recombinase or a transposase. 

With 'recombination site' is meant either site-specific recombination sites or transposon 

1 5 border sequences. 

With 'site specific recombination event' is meant an event catalyzed by a system 
generally consisting of three elements: a pair of DNA sequences (the site-specific 
recombination sequences or sites) and a specific enzyme (the site-specific 
recombinase). The site-specific recombinase catalyzes a recombination reaction only 

20 between two site-specific recombination sequences depending on the orientation of the 
site-specific recombination sequences. Sequences intervening between two site- 
specific recombination sites will be inverted in the presence of the site-specific 
recombinase when the site-specific recombination sequences are oriented in opposite 
directions relative to one another (i.e. inverted repeats). If the site-specific 

25 recombination sequences are oriented in the same direction relative to one another 
(i.e. direct repeats), then any intervening sequences will be deleted upon interaction 
with the site-specific recombinase. Thus, if the site-specific recombination sequences 
are present as direct repeats at both ends of vector backbone sequences integrated 
into a eukaryotic genome, such integration of said sequences can subsequently be 

30 removed by interaction of the site-specific recombination sequences with the 
corresponding site specific recombinase. 

A number of different site specific recombinase systems can be used including but not 
limited to the Cre/lox system of bacteriophage P1, the FLP/FRT system of yeast, the 
Gin recombinase of phage Mu, the Pin recombinase of E. cort, the PinB, PinD and PinF 
35 from Shigella, and the R/RS system of Zygosaccharomyces rouxii. Recombinases 
generally are integrases, resolvases or flippases. Also dual-specific recombinases can 
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be used in conjunction with direct or indirect repeats of two different site-specific 
recombination sites corresponding to the dual-specific recombinase (WO99/25840). 
The preferred site-specific recombinase systems are the bacteriophage P1 Cre/lox and 
the yeast FLP/FRT and the Z. rouxii R/RS systems. In these systems a recombinase 
5 (Cre. FLP or R, respectively) interact specifically with its respective site-specific 
recombination sequence (lox, FRT or RS respectively) to invert or excise the 
intervening sequences. The site-specific recombination sequences for each of these 
two systems are relatively short (34 bp for lox and 47 bp for FRT). Some of these 
systems have already been used with high efficiency in plants such as tobacco (Dale et 

10 al. 1990, Onouchi et al. 1991 , Sugita et al. 2000) and Arabidopsis (Osborne et al. 1995, 
Onouchi et al. 1995). Site-specific recombination systems have many applications in 
plant molecular biology including methods for control of homologous recombination 
(e.g. US5527695), for targeted insertion, gene stacking, etc. (W099/25821) and for 
resolution of complex T-DNA integration patterns or for excision of a selectable marker 

15 (WO99/23202). In these applications the site-specific recombination sites are typically 
part of the DNA integrated in the eukaryotic genome as T-DNA or via homologous 
recombination, and thus are not present in the vector backbone sequence. 
Although the site-specific recombination sequences must be linked to the ends of the 
DNA to be excised or to be inverted, the gene encoding the site specific recombinase 

20 may be located elsewhere. For example, the recombinase gene could already be 
present in the eukaryote's DNA or could be supplied by a later introduced DNA 
fragment either introduced directly into cells, through crossing or through cross- 
pollination. Alternatively, a substantially purified recombinase protein could be 
introduced directly into the eukaryotic cell, e.g. by micro-injection or particle 

25 bombardment. Typically, the site-specific recombinase coding region will be operably 
linked to regulatory sequences enabling expression of the site-specific recombinase in 
the eukaryotic cell. 

As part of the current invention, site-specific recombination sites are introduced in the 
T-DNA vector such that they lie outside the T-DNA but allow transferred vector 

30 backbone sequences or parts thereof possibly originating from said T-DNA vector to be 
excised post-transformationally by the action of a site-specific recombinase. 
With 'recombination event effected by transposon jumping 1 or 'transposase-mediated 
recombination' is meant a recombination event catalyzed by a system consisting of 
three elements: a pair of DNA sequences (the transposon border sequences) and a 

35 specific enzyme (the transposase). The transposase catalyzes a recombination 



BNSDOCID: <WO 01'*482A2 l_> 



WO 01/44482 



18 



PCT/EP00/12683 



reaction only between two transposon border sequences which are arranged as 
inverted repeats. 

A number of different transposon/transposase systems can be used including but not 
limited to the Ds/Ac system, the Spm system and the Mu system. These systems 
5 originate from corn but it has been shown that at least the Ds/Ac and the Spm system 
also function in other plants (Fedoroff et al. 1993, Schtappi et al. 1993, Van Sluys et al. 
1987). Preferred are the Ds- and the Spm-type transposons which are delineated by 1 1 
bp- and 1 3 bp- border sequences, respectively. 

Although the transposon border sequences must be linked to the ends of the DNA to 
10 be excised, the gene encoding the transposase may be located elsewhere. For 
example, the recombinase gene could already be present in the eukaryote's DNA or 
could be supplied by a later introduced DNA fragment either introduced directly into 
cells, through crossing or through cross-pollination. Alternatively, a substantially 
purified transposase protein could be introduced directly into cells, e.g. by 
1 5 microinjection or by particle bombardment. 

As part of the current invention, transposon border sequences are introduced in the T- 
DNA vector such that they lie outside the T-DNA and transform the vector backbone or 
part thereof into a transposon-like entity that can move by the action of a transposase. 
As transposons, and thus in the current invention the vector backbone or part thereof, 
20 often reintegrate at another locus of the host's genome, segregation of the progeny of 
the hosts in which the transposase was allowed to act might be necessary to separate 
transformed hosts containing only the T-DNA and transformed hosts containing only 
the vector backbone or part thereof. 

Another embodiment of the invention comprises optimized T-DNA transformation 
25 vectors with additional DNA sequences outside the T-DNA border core repeats as 
exemplified in Figure 3 and enabling post-transformational removal of integrated vector 
backbone sequences. Said additional DNA sequences modify, including multiply, the T- 
DNA border regions or parts thereof and comprise: 

f) recombination sites organized as repeats downstream of the left border core 
30 sequence and upstream of the right border core sequence; or 

g) said DNA sequences of (f) further modified by adding a second copy of a left border 
region positioned upstream of and preferably adjacent to the single right border 
outer region and said recombination site upstream of the core sequence of said 
second left border region; or 
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h) said recombination sites of' (f) with a recombinase gene, downstream of said 
recombination site downstream of the left border core sequence, and preferably, 
when present, adjacent to and downstream of the left border outer region; or 

i) a DNA sequence located downstream of the left border region, said DNA sequence 
5 comprising a recombinase gene flanked by repeats of recombination sites as 

defined in (h) and further comprising a second copy of a left border region 
downstream of said recombinase; or 
j) the DNA sequence of (i) with additional recombination sites organized as repeats 
downstream of the second left border core sequence and upstream of the single 

1 0 right border core sequence; 

In any of said modifications (f), (g), (h), (i) or (j). said recombination sites are located 
adjacent to and downstream and/or upstream of the left- and/or right border core 
sequences or are separated from the left- and/or right border core sequences by a 
sequence of at least 10-20 bp in length optionally carrying stop codons in the three 

1 5 reading frames and in both directions. 

In any of said modifications (f), (g), (h), (i) or (j). said recombination sites are either site- 
specific recombination sites arranged as direct repeats or transposon border 
sequences arranged as inverted repeats and said recombinase gene is either a site- 
specific recombinase gene or a transposase gene, respectively. 

20 It is obvious that said modifications (f) or (g) in said T-DNA transformation vectors are 
to be considered as simple modifications, including multiplication, of the T-DNA border 
regions or parts thereof. 

Introduction of additional DNA sequences downstream of the T-DNA left border core 
repeat to prevent left border read-through (WO99/01563) or to eliminate non-T-DNA 

25 sequences (Hanson et al. 1999) have been described and these sequences prevent 
the development of transformants having integrated vector backbone sequences. In 
addition to said modifications (f), said modification (g) of the T-DNA vector has the 
advantage that read-through at the first LB core sequence can be halted at the second 
LB core sequence, thus preventing duplication of the T-DNA fragment. Inserted vector 

30 backbone sequences are subsequently removed by the action of an appropriate 
recombinase. In addition to said modifications (f), said modification (h) introduces in a 
T-DNA vector a recombinase gene sequence downstream of the left border core 
sequence to allow for resolving the integrated vector backbone sequence possibly 
originating from said T-DNA vector at the recombination sites introduced in said 

35 modified T-DNA borders. Thus, and contrary to WO99/01563, transformants having 
integrated vector backbone sequences can be rescued after curing, i.e. after removal 
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of the vector backbone sequences flanked by the recombination sites introduced in 
said modified T-DNA borders. Concurrently, the recombinase gene is excised. 
Said modifications (i) further change the design of the left border region, i.e. an 
additional copy of a left border region is added. Both copies are, however, not arranged 
5 in tandem as described supra in the first approach to prevent vector backbone 
integration, but are separated by the recombinase gene and the border core 
sequences are occurring in their broader border context. Said modifications (i) are thus 
to be considered as a modification of the T-DNA vector construct described supra in 
which the single LB region contains a tandem arrangement of the LB core sequence. 
1 0 The presence of a second copy of a left T-DNA border region has the advantage that 
illegitimate DNA transfer starting at or read-through at the first left border core 
sequence can be halted at the second left border core sequence. Thus, only part of the 
vector backbone sequence is transferred to the eukaryotic nucleus. As this vector 
backbone region is flanked by recombination sites, it can easily be removed by means 
15 of action of a recombinase. Illegitimate DNA transfer starting at or read-through at the 
second copy of the left border core sequence can, however, still occur. 
Therefore, said modification (j) further adds a pair of recombination sites downstream 
of the second copy of the left border core sequence and upstream of the single right 
border region. Integrated vector backbone sequences can again be easily removed by 
20 action of a recombinase at the recombination sites. In the case of illegitimate DNA 
transfer starting at the second copy of the left border core sequence, the recombinase 
will not be supplied by the T-DNA transformation vector backbone. It must thus be 
supplied from elsewhere, e.g. through sexual crossing with a plant already containing 
the recombinase gene in it's genome. 
25 Any of said modifications (f), (g), (h), (i) or G) is also applicable to cure transgenic cells 
from vector backbone sequences integrated into the genome of a eukaryote 
independently of the T-DNA, i.e. not physically linked to the T-DNA. 
Further comprised in the current invention is a method using said T-DNA 
transformation vectors containing said DNA sequences of (f) or (g) in combination with 
30 the supply of a recombinase, or using said T-DNA transformation vectors containing 
said DNA sequences (h), (i) or (j) for curing of transformed cells containing backbone 
sequences of said vector backbone sequences optionally in combination with the 
supply of a recombinase. 

Site-specific recombinases or transposases introduced into the host's genome for 
35 enabling recombination can subsequently be removed by segregation of the progeny of 
the transformed host in which recombination was allowed to occur. Segregation of said 
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progeny also is a way to separate transformed hosts containing only the T-DNA and 
transformed hosts containing only the vector backbone or part thereof. 
The current invention further embodies incorporation of any of said additional DNA 
sequences in any T-DNA vector comprising binary transformation vectors, super-binary 
5 transformation vectors, co-integrate transformation vectors, Ri-derived transformation 
vectors as well as in T-DNA carrying vectors used in agrolistic transformation or gene 
therapy. 

In one embodiment of the current invention, the recombinase gene is supplied to the 
transgenic plants containing a vector backbone sequence flanked by recombination 

10 sites by sexual crossing with a plant containing the recombinase gene in it's genome. 
Said recombinase can be operably linked to either a constitutive or an inducible 
promoter. The recombinase gene can alternatively be under the control of single 
subunit bacteriophage RNA polymerase specific promoters, such as a T7 or a T3 
specific promoter, provided that the host cells also comprise the corresponding RNA 

15 polymerase in an active form. Yet another alternative method for expression of the 
recombinase consists of operably linking the recombinase open reading frame with an 
upstream activating sequence fired by a transactivating transcription factor such as 
GAL4 or derivatives (US5801027, WO97/30164, WO98/59062) or the Lac repressor 
(EP0823480), provided that the host cell is supplied in an appropriate way with the 

20 transcription factor. 

In another embodiment of the invention, the recombinase gene is supplied on the 
transformation vector backbone and the promoter of said recombinase gene preferably 
is an inducible promoter. Such promoters are known to those familiar with the art and 
include e.g. a heat-shock responsive promoter, a glucocorticoid-inducible promoter or a 

25 promoter inducible by another chemical compound (e.g. as disclosed in EP0332104 
and WO90/08826). 

In a preferred embodiment, expression of the recombinase gene in bacterial hosts is 
prohibited by including (an) intron sequence(s) in the coding region of said 
recombinase gene. 

30 It is known in the art that increased levels of VirD1 and VirD2 in Agrobacterium lead to 
higher levels of plant transformation (Wang et al. 1990). VirD1 and VirD2 have also 
been integrated in a method for improved integration of exogenous DNA delivered to 
eukaryotic cells by means of transforming said eukaryotic cell with chimeric virDl 
and/or virD2 genes (WO97/12046). It has, however, so far not been demonstrated that 

35 increased production of VirDl and/or VirD2 can prevent integration of vector backbone 
sequences. 
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Thus, in another embodiment of the invention, vector backbone sequence integration is 
prevented by enhancing the efficiency of the nicking at the left border core sequence of 
a T-DNA vector by increasing the production of the T-DNA nicking complex involving at 
least endonucleases VirD1 and VirD2. 
5 In a preferred embodiment of the invention, additional copies of the octopine-type virD 
locus are integrated either into the genome of Agrobacterium and/or into a helper 
plasmid and/or into a binary transformation vector and/or into a superbinary 
transformation vector and/or into a co-integrate transformation vector and/or a Ri- 
derived plant transformation vector. In another embodiment of the invention, additional 
10 copies of the nopaline-type virD locus can be supplied as said. 

The current invention thus presents three methods either to prevent or to cure 
integration of transformation vector backbone sequences and using: 

1) any of the T-DNA vectors (a) to (e) modified at the left T-DNA border region such 
that read-through is prevented or halted or DNA-transfer starting at this border is 

1 5 prevented or halted; or 

2) any of the T-DNA vectors (f) to (j) which enable resolving of transformation vector 
backbone sequences integrated in the genome of transgenic cells by means of 
recombination; or 

3) at least one extra copy of the virD locus in Agrobacterium to increase the levels of 
20 VirD1 and VirD2 as minimal constituents of the T-DNA border nicking complex. 

It is clear to those familiar with the art that any of these methods or parts thereof can be 
used either alone, in combinations of two, or as a combination of all three of them. 
Exemplary T-DNA vector designs reflecting some of such combinations are given in 
Fig. 3. 

25 Preferred combinations of the inventive methods (1) or (2) with inventive method (3) 
consist of a transformation vector carrying a modified LB region containing a tandem 
arrangement of the LB core sequence or multiple copies of the LB region and the 
enhanced production of VirD1 and VirD2 by Agrobacterium, It can indeed be expected 
that the tandem of LB core sequences or the multiple copies of the LB region titrate the 

30 VirD1 and VirD2 proteins available in an Agrobacterium strain normally used for 
transformation. As a result, these proteins would no longer be available for establishing 
the nick at the single RB core sequence. 

Another embodiment of the invention thus comprises the use of any of said methods 
and/or T-DNA vector modifications to prevent or to cure transformation vector 
35 backbone integration either separately or in any combination. Preferred combinations 
are those in which enhanced levels of VirD proteins produced by Agrobacterium are 
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allowed to act on T-DNA vectors harboring multiple copies of the LB core sequence or 
multiple copies of the LB regions. 

It is also clear that any of said methods and/or T-DNA vector modifications to prevent 
or to cure vector backbone integration can be combined with any other method 
5 preventing or curing vector backbone integration. Such methods and T-DNA vector 
modifications include the addition of genes or DNA sequences downstream of the left 
border, e.g. as disclosed in WO99/01563. Such genes/DNA sequences include genes 
encoding cytotoxic compounds, antisense housekeeping genes, sequences prohibiting 
unwinding of the DNA behind the left border region, e.g. sequences with high GC 
1 0 content or vir box sequences interacting with DNA-binding proteins. 

Another embodiment of the invention thus comprises the use of any of the methods 
and/or T-DNA vector modifications of the current invention to prevent or to cure 
integration of transformation vector backbone sequences in combination with any other 
method and/or T-DNA transformation vector modification to prevent or to cure 
1 5 transformation vector backbone integration. 

It will furthermore be clear to the one skilled in the art that any of the methods and/or T- 
DNA vector modifications of the current invention to cure integration of transformation 
vector backbone sequences can be combined with any other method and/or T-DNA 
vector modification using or incorporating any recombination systems, e.g. to excise 
20 selectable markers active in plants. Said curing method can indeed be combined with 
e.g. excision of a selectable marker by flanking the exemplary selectable marker either 
with the same or with different recombination sites as those flanking the vector 
backbone sequence. In said curing method, the vector backbone sequence can also be 
flanked by two different site-specific recombination sites and curing can be performed 
25 by a dual-specific recombinase with specificities corresponding to the site-specific 
recombination sites used. 

A further embodiment of the current invention thus comprises the use of the methods 
and/or T-DNA vector modifications of the current invention to cure integration of 
transformation vector backbone sequences in combination with methods and/or T-DNA 
30 vector modifications using or incorporating any recombination system for purposes 
other than curing of vector backbone sequence integration. 

Another embodiment of the current invention comprises curing of integrated 
transformation vector backbone sequences implying modification of the T-DNA borders 
by adding any pair of mutually different site-specific recombination sites and used in 
35 conjunction with an at least dual-specific recombinase with specificities at least 
corresponding to the site-specific recombination sites used. 
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In another embodiment of the invention, any of said T-DNA vector constructs according 
to the invention is mobilized to an Agrobacterium strain. The resulting strains also 
constitute to the invention. 

In yet another embodiment of the invention, any of said T-DNA vectors according to the 
5 invention is used in an Agrobacter/um-mediated or agrolistic transformation, such 
transformation methods being known to the one skilled in the art. Said T-DNA vectors 
according to the invention can be used to transform several plant tissues comprising 
roots, protoplasts, leaves etc. of different monocotyledonous and dicotyledonous plant 
species such as, but not limited to, Arabidopsis, tobacco, petunia, tomato, potato, 
10 beans, rice and wheat. More in general, it can be any monocotyledonous or 
dicotyledonous plant, preferbly belonging to a plant species of interest in agriculture, 
wood culture, horticulture or to a plant species applied in the production of 
pharmaceuticals, biochemicals, antibodies or perfumes. Such plants include crop 
plants, root plants, oil-producing plants, wood-producing plants, agricultured plants, 
15 fruit-producing plants, fodder or forage legumes, companian plants or horticultured 
plants. Such plants further include apricot, artichoke, asparagus, apple, banana, barley, 
broccoli, Brussels sprouts, cabbage, canola, carrot, cassava, cauliflower, celery, 
cherry, chicory, collard greens, cotton, Douglas fir, fir (Abies and Picea species), flax, 
garlic, grapes, kale, lentil, maize, oak, oats, oilseed rape, okra, onion, pear, pepper, 
20 poplar, rye, sorghum, soybean, squash, sugar beet, sugar cane, sunflower. 

Said T-DNA vectors according to the invention can also be used in combination with 
the flower dip plant transformation method (Clough and Bent 1 998) or with the in planta 
apical shoot transformation procedure (W099/14348). Said T-DNA vectors according 
to the invention can also be used to transform other eukaryotic cells including yeast, 
25 moulds and filamentous fungi and their conidia, hyphae or protoplasts derived thereof. 
Agrobacterium-med\a\ed transformation of said other eukaryotic cells is known to those 
skilled in the art (e.g. W098/45455). A key characteristic of the eukaryotic cells 
transformed via Agrobacterium-rned\a\e6 transfer or agrolistic transfer of any of the T- 
DNA transformation vectors of the invention is a significantly decreased frequency of 
30 integrated vector backbone sequences without greatly affecting the frequency of 
integration of T-DNA sequences. 

A preferred embodiment of the invention comprises transgenic plant cells obtained by 
any method of >4probacter/um-mediated transfer or agrolistic transfer of any of the T- 
DNA transformation vectors of the invention. Also comprised are the transgenic plants 
35 obtained or regenerated by any method after any method of /\grotoacfer/um-mediated 
transfer or agrolistic transfer of any of the T-DNA transformation vectors of the 
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invention. Such plants are characterized in that they contain T-DNA sequences, but not 
vector backbone sequences, in their genome. Further comprised are the offspring of 
said transgenic plants as well as cells, protoplasts, calli, tissues, organs, seed, fruit, 
pollen, egg cells, zygotes, zygotic or somatic embyros derived thereof or derived from 
5 said transgenic plant cells. 

The potential for using T-DNA carrying vectors according to the present invention in 
gene therapy can be explained as follows. It has been described that in vitro 
reconstituted complexes consisting of VirD2-ssDNA-VirE2 are able to transfer the 
ssDNA intactly into mammalian nuclei (Ziemienowicz et al. 1999). 
10 Vir D2 protects ssDNA against exonucleolytic degradation because it is covalently 
linked to the 5' end of the ssDNA (Durrenberger et al. 1989, Young and Nester 1988) 
and, by means of two nuclear localization sequences (NLSs), targets the ssDNA to the 
plant cell nucleus (Narasimhulu et al. 1996, Rossi et al. 1993, Shurvinton et al. 1992). 
VirD2 furthermore contains an 'omega 1 domain important for efficient integration of the 
15 T-DNA in the host genome (Narasimhulu et al. 1996, Mysore et al. 1998, Tinland et al. 
1995). The NLSs of VirD2 have been shown to be active in animal cells including 
human HeLa cells, Drosophila embryos and Xenopus oocytes (Guralnick et al. 1996, 
Ziemienowicz et al. 1999). VirD1 remains localized in the cytoplasm of mammalian 
cells but can be imported into the nucleus via a piggy-back mechanism involving VirD2 
20 (Ziemienowicz et al. 1999). 

VirE2 protects ssDNA against (endo)nucleolytic degradation and preserves integrity of 
the T-DNA (Gelvin 1998b, Rossi et al. 1996). The VirE2 protein is furthermore actively 
involved in targeting the T-DNA to the plant nucleus (Gelvin 1998b). The NLSs of VirE2 
may be plant specific but repositioning of a single amino acid within the VirE2 NLSs 
25 targets these modified proteins to the nuclei of animal cells (Guralnick et al. 1996). 
VirE2 proteins interact amongst themselves but interaction of VirE1 with VirE2 is much 
stronger and VirE1 inhibits VirE2 self-interaction. In Agrobactehum cells, VirE1 
prevents VirE2 from aggregating. VirE1 thus seems to function as a molecular 
chaperone of VirE2 (Deng et al. 1999). 
30 The characteristics of these Vir proteins make them ideal candidates for their 
application in gene therapy experiments as indicated by Ziemienowicz et al. (1999). 
One of the main problems in gene therapy indeed consists of the difficulty of DNA- 
delivery across the intracellular barriers including nucleolytic degradation and nuclear 
uptake. 

35 Viral vectors are one of the major vehicles used by scientists in gene therapy to get a 
DNA sequence expressed in the proper host. Retroviral vectors (including HIV and 
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MMLV) are employed in 63% of the gene therapy protocols approved by the 
Recombinant DNA Advisory Committee (a division of the NIH) whereas adenoviral 
vectors are used in 1 6% of such protocols. Other viral vectors include those based on 
AAV, HSV and Vaccinia. Viral vectors form an area of continual new development in 
5 gene therapy. 

It can be envisaged that the genes for the Agrobacterium proteins VirD1 , VirD2 and 
VirE2 (and eventually VirE1) are incorporated in a viral vector in such a way (i.e. 
including adapted codon usage and appropriate regulatory sequences) that they can 
be expressed in animal cells, preferably transiently, as preferably without integration of 
1 0 said v/r-genes in the host's genome. If a T-DNA containing the gene of interest for gene 
therapy purposes, is present in the same viral vector or is co-delivered on a separate 
viral vector or another type of viral vector, said T-DNA could be then efficiently 
transferred to the genome in nuclei of animal cells. A strict requirement for approval of 
such a gene therapy strategy would, however, be that only the T-DNA, and not any 
15 other external DNA is transferred to the nucleus. Any modification presented in the 
current invention to prevent or to cure transfer of DNA sequences not belonging to the 
T-DNA would thus be of great value for increasing the acceptability of gene therapy 
strategies that include the use of Agrobacterium proteins for providing and transferring 
the DNA of interest. Therefore, T-DNA carrying vectors modified according to any of 
20 the modifications or any combination of modifications of the current invention and being 
used for gene therapy purposes are included in the present invention. 
It is furthermore clear to the skilled artisans that at least the T-DNA vectors according 
to the current invention and comprising modified LBs resulting in highly efficient nicking 
at this LB can be utilized as a source of e.g. VirD2 - ss-T-DNA - VirE2 complexes that 
25 can be used in other gene therapy techniques including microinjection, electroporation, 
carrier mediated delivery and ballistic DNA injection. Said complexes could be enriched 
from e.g. an acetosyringone-induced Agrobacterium culture or a culture of E. coli 
expressing at least the VirD1, VirD2 and VirE2 proteins by means of e.g. affinity 
chromatography using an antibody recognizing the VirD2 protein. 
30 The invention, now being generally described, may be more clearly understood by 
reference to the following examples, which are included merely for purposes of 
illustration of certain aspects and embodiments of the present invention and are not 
intended to limit the invention. The contents of all references referred to in this text are 
hereby incorporated by reference. 
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Description of Figures 

Figure 1. Schematic representation of the percentage of A- and T-residues (indicated 
above and below the horizontal line indicating the vector) per 100 bp blocks around 
5 (and not including) left and right border core imperfect repeats of the K and Hsb T-DNA 
vectors described and used in Examples 1-3. 

Figure 2. Schematic representation of T-DNA constructs designed to asses the 
efficiency of nicking at the modified left border in conjunction with a right border lacking 
10 the border inner region. Gus: p-glucuronidase coding sequence; nptll: neomycin 
phosphotransferase II coding sequence; 3' ocs: 3' terminator of the octopine synthase 
gene; Pnos: promoter of the nopaline synthase gene; 3' nos: 3' terminator of the 
nopaline synthase gene; P35S: constitutive 35S RNA promoter of the cauliflower 
mosaic virus. 

15 

Figure 3. Schematic representation of T-DNA constructs designed to allow curing of 
vector backbone sequences integrated in the genome of a eukaryote. 
LB: left border; RB: right border. 

20 Figure 4. Schematic representation of exemplary T-DNA constructs featuring 
combinations of modifications according to the current invention. LB: left border; RB: 
right border; modLB: modified left border (see Figure 2 for possible modifications). 

Figure 5. DNA gel blot analysis. 

25 (A) Schematic representation of the digest and the probe used in the DNA gel blot 
analysis (not drawn to scale). Each plant genomic DNA sample and the plasmids used 
as positive controls were digested with Kpnl and Sac/, both cutting in the vector 
backbone outside the LB and RB. As probe the vector sequence of the K plasmid, 
shown as a hatched bar, was used after restriction with Kpnl and SacL 

30 (B) DNA gel blot analysis. The T-DNA vectors of which at least 1000 bp vector 
backbone was found via PCR analysis of the transgenic plants are indicated at the top 
of each lane. A fragment of 8476 bp (arrow 1) is observed when the entire vector 
backbone sequence of the K plasmid (K1 and K2 carry a different structural gene in 
their T-DNA but are furthermore identical) is integrated, whereas fragments of 7066 bp 

35 (arrow 2) are detected when the entire vector backbone sequence of the Hsb plasmid 
is integrated. The K and Hsb T-DNA plasmids are described in Example 1 . 
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Genomic DNA samples in lanes 1, 2 and 5 are derived from plants from experiments 
other than those described. 

Lane C: genomic DNA of untransformed A. thaliana. 

Plasmid lanes: contain pure DNA of the indicated plasmids digested with Kpnl and 
5 Sacl. 
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Examples 

V 

Example 1 

Transformation of A thaliana (L.) Heynh and Nicotiana tabacum (L.) 

5 Plasrnids pK2L610 ('K* T-DNA vector; De Buck et ai. 1998), pSingle gus ('KsbV T-DNA 
vector; see below) and pHSB610 ('Hsb' T-DNA vector; De Buck et al. 1999) were used 
for plant transformation. The vector pSingle gus ('KsbV) is similar to the vector 
pHSB610 ('Hsb') but the Pnos-hpt-3'nos fragment of pHSB610 ( 4 Hsb') is exchanged for 
a Pnos-nptll-3'nos cassette in pSingle gus ('Ksb1'). As a selectable marker for plant 

10 transformation, the K and Ksb1 vectors contain the neomycin phosphotransferase II 
gene (nptll) and the Hsb vector contains the hygromycin phosphotransferase gene 
(hpt). A. thaliana was co-transformed with plasrnids K and Hsb according to the 
>4grobacter/um-mediated Arabidopsis root transformation method (De Buck et al. 1999, 
Valvekens et al. 1988). Tobacco (A/, tabacum) was transformed with plasmid Ksb 

1 5 according to the >\grobacfer/um-mediated leaf disc transformation method (Horsch et 
al. 1985). Transgenic plants were regenerated on media containing the appropriate 
plant growth hormones and the appropriate selective agent. In series 1,18 transgenic 
A. thaliana plants co-transformed with the K and Hsb T-DNA vectors were obtained. In 
series 2, 36 N. tabacum plants transformed with the Ksb T-DNA vector were obtained. 

20 In an additional series, 26 N. tabacum plants transformed with the Ksb T-DNA vector 
were obtained. 

Example 2 

Integration of vector backbone sequences assessed by PCR 
25 For PCR analysis, DNA was isolated from leaf material of N. tabacum as described by 
Jones et al. (1985) and from A. thaliana according to De Neve et al. (1997). To screen 
the transgenic plants on the integration of vector sequences, different PCR reactions 
were performed. DNA (100 ng) was incubated with 500 ng of each primer in 1x Taq 
polymerase incubation buffer (Roche Diagnostics, Brussels, Belgium). Two and a half 
30 units of Taq polymerase were added to a final volume of 100 pL. Samples were heated 
to 94°C for 5 min before PCR. Denaturation was at 94°C for 1 min. Annealing occurred 
during 2 min at 57°C, and the extension reaction was at 72°C for 5 min, while 30 cycli 
were performed. The primer combinations were chosen so that the presence of at least 
100 bp ('RB100* and 'LB100* in Tables 1 and 2) or at least 1000 bp ('LB1000' and 
35 'RB1000' in Tables 1 and 2) of vector backbone results in a diagnostic PCR fragment 
of known size. To make sure that the amplified PCR product is not derived from 
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contaminating Agrobacterium cells that would still be present in the plant tissue, a PCR 
reaction was performed on each DNA with two primers specific for the chromosomal A. 
tumefaciens gene picA (Yusibov et al. 1 994). 

The series 1 A. thaliana plants contains 1 8 transgenic plants co-transformed with the K 
5 and the Hsb T-DNA vectors. The K vector contains borders in the natural octopine 
context with inner and outer border regions of 150 bp and 255 (RB) - 300 (LB) bp, 
respectively, whereas the Hsb vector contains only the outer border regions. 
Screening for the integration of vector backbone sequences was revealed that vector 
sequences from the K and Hsb T-DNAs were found in 33% (6/18) and in 61% (11/18) 

10 of the transgenic plants, respectively. Only in one transformant the vector sequences 
were not linked to the T-DNA. No major differences were detected in frequencies of 
vector joined to the RB region of the Hsb (6/18) and the RB (4/18) region of the K 
plasmid (see Tablel). However, much more vector was integrated at the LB region of 
the Hsb T-DNA (11/18) than of the K T-DNA. In all cases, in which an integrated T- 

1 5 DNA contains vector sequences at the RB, also vector sequences at the LB could be 
detected. 

The series 2 tobacco plants contains 36 transgenic plants transformed with a Ksb 
vector. This vector contains also RB and LB without inner border regions and is 
therefore comparable to the Hsb vector. In 53% (19/36) of the analyzed transformants 
20 (see Table 2) vector sequences were found; these were linked to the LB (8/19) only, to 
the right T-DNA end (1/19) only, or to both ends (10/19). 

In another series of tobacco transformants containing the Ksb T-DNA (data not shown), 
81% of the transformants (21/26) contained vector sequences linked at the LB and 
61% (16/26) linked at the RB (data not shown). Strikingly however, of these 16 
25 transformants containing RB backbone sequences, 1 5 of them contain also integrated 
LB vector sequences (data not shown). 

Taken together, in the three different series of transformants in which a T-DNA vector 
was used without inner border sequences, more than 50% of the transformants 
contained vector backbone DNA. Very few transformants contained vector DNA linked 

30 only to the right T-DNA end. In general, when vector DNA linked to the right border was 
present, also vector DNA linked at the left T-DNA end could be detected. This implies 
that especially read-through at the left border repeat, due to inefficient nicking is 
responsible for the integration of vector backbone sequences. It is possible that the 
deletion of the inner border region, a piece of T-DNA present in the original Ti plasmid 

35 form which the vector was derived, causes inefficient nicking of the LB repeat, which 
results in read-through past the LB and the transfer of downstream-located vector 
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sequences. Consistently, Horsch and Klee (1986) observed that the overal rate of 
transfer to plants of plasmids containing only the 25 bp repeat is not as high as with the 
larger Ti-derived border fragments, suggesting that additional sequences surrounding 
the border repeats play some role in the transfer. These authors have, however, not 
5 addressed the question of possible integration of vector backbone sequences in 
transgenic plants using said vector. 

Example 3 

Integration of vector backbone sequences assessed by DNA gel blotting 

10 DNA gel blot analysis was performed essentially according to Maniatis et al. (1982) on 
approximately 0.6 pg genomic DNA. The restriction sites used (Kpnl-Sacl) are 
indicated in Figure 5A. As probe, the vector sequence of the K plasmid was used after 
restriction with Kpnl and Sacl. The non-radioactive "Gene images" random prime 
labelling module and the "Gene images" CDP Star detection module (Amersham, 

1 5 Aylesburg, UK) were used for the hybridization and the detection, respectively. 

As outlined in Example 2, linkage of both RB and LB regions to at least a 1000 bp 
vector backbone fragment of one or both plasmids was observed in different 
transformants of series 1 (see Table 1). To determine whether the complete binary T- 
DNA plasmid (T-DNA + vector backbone sequences) is integrated into the genome of 

20 these transformants, a DNA gel blot analysis was performed with the K vector 
sequence as probe. When the whole vector sequence is present, restriction of the 
genomic DNA with Kpnl and Sacl should result in the detection of a fragment of 
approximately 8467 bp for K (arrow 1 in Figure 5B) and of 7066 bp for Hsb (arrow 2 in 
Fig. 5B). The DNA gel blot analysis given in Figure 3B shows that in 10 of the 11 

25 analyzed plants, the whole vector backbone sequence backbone was integrated into 
the plant genome. Samples in lanes 1, 2 and 5 are derived from plants from 
experiments other than those described. These results suggest that complete vector 
sequences are frequently integrated into the plant genome. 

30 Example 4 

Construction of optimized T-DNA vectors with modified left borders. 

Exemplary optimized binary T-DNA vectors with modifications of the types indicated in 
Figure 2 were constructed as follows. 

In this example, all optimized binary T-DNA vectors described in this example are 
35 derived from pTHW136. The binary vector pTHW136 contains octopine-type right and 
left border core sequences forming an imperfect repeat, each flanked by octopine-type 
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border outer regions originating from pTi15955 (224 bp in the case of the RB outer 
region and 268 bp in the case of the LB outer region). On the T-DNA of pTHW136 are 
located a GUS-intron expression cassette (35S promoter-GUS open reading frame 
interrupted by an intron-35S terminator) and a nptll selectable marker gene (nos 
5 promoter-npf// open reading frame-ocs terminator). 

To create exemplary binary T-DNA vectors of the types indicated in Figure 2, the GUS- 
intron expression cassette was removed from pTHW136 by digestion with Xba\ and 
H/ndlll, followed by filling in the overhangs with the Klenow polymerase and by 
religation of the resulting blunt ends. The vector obtained by this procedure is further 

1 0 denominated as p01 30. 

In a following step, the nptll selectable marker gene was removed from p0130 by 
digestion with BamHI and subsequent religation of the compatible vector overhangs. 
This procedure yielded a binary T-DNA vector denominated as pCDVIB. The vector 
pCDVIB thus contains an octopine-type LB region spanning a core sequence flanked 

15 by a border outer region and thus is a construct of the B-type as indicated in Figure 2. 
To create a construct of the E-type indicated in Figure 2 t two long oligonucleotides, 
prmCDVIBIF and prmCDVIBIR, were designed with the following sequences: 
-prmCDVIBIF: 

5'(1 ^CATGGAGCGGC GGCAGGATATATTCAATTGTAA ATGGCTAGCGGC GGCAGG 
20 A TA TA TTCAA TTG TA4 ATGGCTG(84)3 , (SEQ ID NO 5); partial A/col-site in bold and 
double underlined, left border core sequences in italics and single underlined, 3* end G 
residue part of a BamHI-site; and 
-prmCDVIBIR 

5'(1 ^G^CC AGCC AT TTACAATTGAATATATCCTGCC GCCGCT AGCC AT TTACAATT 
25 GAATATATCCTGCCG CCGCTC{%4)3 (SEQ ID NO 6); partial SamHI-site in bold and 
double underlined, left border core sequences in italics and single underlined, 3' end C 
residue part of a A/col-site. 

The oligonucleotides prmCDVIBIF and prmCDVIBIR are complementary over a 
distance of 80 nucleotides and yield, after annealing, a dsDNA fragment with at its 5* 

30 (relative to prmCDVIBIF) a 5' Nco\ overhang ('CATG') and at its 3' (relative to 
prmCDVIBIF) a 5' SamHI overhang ('GATC'). Said dsDNA fragment furthermore 
contains a tandem of two octopine-type LB core sequences corresponding to 
nucleotides 12-33 and 46-67 of prmCDVIBIF. Both LB core sequences are extended 
upstream and downstream by a further 6 basepairs originating from the natural LB of 

35 pTil 5955 and identical as in pTiAch5 (Gielen et al. 1984). The repeats in the tandem of 
LB core sequences are thus separated by 12 bp. Said dsDNA fragment with the 



0144482A2 1 > 



WO 01/44482 



PCT/EP00/12683 



tandemly arranged LB core sequences was inserted upstream of and adjacent to the 
LB core sequence of p0130 digested with Ncol and BamHI, yielding pCDVIBt. 
pCDVIBI thus contains an octopine-type left border region spanning a LB outer region 
and three tandemly arranged LB core sequences. 
5 To create a construct of the A-type indicated in Figure 2, two long oligonucleotides, 
prmCDVIB2F and prmCDVIB2R, were designed with the following sequences: 
-prmCDVIB2F: 

fi'M^CATGGCCGGGAAATCTACATGGATCAGCAATGAGTATGATGGTCAATATGGA 
G AAAAAGAAAGAGTAATTACCAA I I I I I I I I CAATTCAAAAATGTAGATGTCCG(1 1 6) 
10 3' (SEQ ID NO 7); partial A/col-site in bold and double underlined, 3' end G residue part 
of a BamHI-site; and 
-pCDVIB2R: 

5'( 1 ^GATCCGGACATCTACATTTTTGAATTGAAAAAAAATTGGTAATTACTCTTTCT^ 
TTTCTCCATATTGACCATCATACTCATTGCTG ATCCATGTAGATTTCCCGGC(1 1 6)3' 
15 (SEQ ID NO 8); partial BamHI-site in bold and double underlined, 3' end C residue part 
of a A/col-site. 

The oligonucleotides prmCDVIB2F and prmCDVIB2R are complementary over a 
distance of 112 nucleotides and yield, after annealing, a dsDNA fragment with at its 5* 
(relative to prmCDVIBIF) a 5' Ncol overhang ('CATG') and at its 3* (relative to 

20 prmCDVIBIF) a 5' BamHI overhang ('GATC'). Said dsDNA fragment furthermore 
contains a 112 bp inner region of pTi15955. This fragment was inserted upstream of 
and adjacent to the LB core sequence of p0130 digested with Ncol and BamHI, 
yielding pCDVIB2. PCDVIB2 thus contains an octopine-type LB region spanning a LB 
core sequence embedded in LB outer and inner regions. 

25 Another T-DNA vector construct was made with a modified LB combining features of E- 
and F-type constructs indicated in Figure 2. An integral nopaline-type LB region was 
obtained as a 331 bp Sc/l-EcoRI fragment of binary T-DNA vector pPZP200. This 
fragment spans nopaline-type LB outer region, LB core sequence and LB inner region 
originating from pTiC58 (Gielen et al. 1999). Said nopaline-type LB region was inserted 

30 upstream of and adjacent to the octopine-type LB region of pCDVIB2 (containing 
tandemly repeated octopine-type LB core sequences) digested with BamHI (located at 
the 3* of the LB core sequence tandem, relative to prmCDVIB2F) and EcoRI, yielding 
pCDVIB3. PCDVIB3 thus contains a LB region spanning a copy of an integral octopine- 
type LB region with three tandemly arranged core sequences and a copy of an integral 

35 nopaline-type LB region. 
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All described exemplary T-DNA vectors with a modified LB region, i.e. pCDVIB, 
pCDVIBI , pCDVIB2 and pCDVIB3 serve as a starting point to insert gene(s) of interest 
and/or selectable marker gene(s) in between RB and LB of said T-DNA vectors. 
The neomycin phosphotransferase (nptll) selectable marker gene under control of the 
5 nopaline synthase (nos) promoter {Pnos-nptll-3'ocs) and the ^-glucuronidase (gas) 
expression cassette under control of the 35S cauliflower mosaic virus promoter (P35S- 
gus-3'nos) were inserted between the RB and LB of said T-DNA vectors. 
The nptll-gus cassette was derived from the pXD610 plasmid (De Loose et a/., 1995) 
as an EcoR\-Age\ fragment. This fragment was inserted into the SaMEcofU sites of 
10 pCDVIB; pCDVIBI; pCDVIB2 and pCDVIB3. To obtain compatible sticky ends, two 
adaptor oligonucleotides between the open Age\- and Sa/l-site were used with the 
following sequences: 

* Napod3: 

CCGGTGGCTCGAGG (SEQ ID NO 9); partial >Agel-site in bold, 3' end G residue part 
15 of a Sa/l-site 
and 

* Napod4: 

TCG AC CTCG AG CCA (SEQ ID NO 10); partial Sa/I underlined, 3' end A residue part of 
a Age\-$\\e 

20 The oligonucleotides Napod3 and Napod4 are complementary over a distance of 8 
nucleotides and yield, after annealing, a dsDNA fragment with at its 5' (relative to 
Napod3) a 5* Age\ overhang ('CCGG') and at its 3* (relative to Napod3) a Sa/I overhang 
(TCGA*). The obtained vectors were called pCDVIB+gusnpt, pCDVIBI +gusnpt, 
pCDVIB2+gusnpt, pCDVIB3+gusnpt. 

25 

Example 5 

Construction of optimized T-DNA vectors with integrated recombination sites. 
Exemplary optimized binary T-DNA vectors of the types indicated in Figure 3 are 
constructed as follows. In this example, the optimized binary T-DNA vector is derived 
30 from pCDVIB (see Example 4). 

One strategy to introduce recombination sites comprises: 

1 . Designing pairs of primers for PGR amplification of vector domains upstream of 
the RB and downstream of the LB. Said domains are to encompass a unique 
restriction site upstream of the RB (e.g. the A/art-site in the pVS1 replicon 
35 contained within the pCDVIB vector backbone) and downstream of the LB (e.g. 

the Bc/l-site in the Sm/Sp resistance marker contained within the pCDVIB vector 
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backbone). The PCR primers located at or close to the RB or LB are designed 
such that they contain a 5' unique restriction site followed by the recombination 
site (e.g. the FRT site) in its right orientation (as direct repeats outside the T- 
DNA borders in case of FRT sites) and followed by part of the border outer 
5 regions. The obtained PCR products are subsequently digested with the 

appropriate enzymes. 

2. Designing complementary oligonucleotide pairs (similar as in Example 4) after 
annealing yielding dsDNA molecules spanning either the RB or LB core 
sequence and with one overhang complementary to the 5' unique restriction site 

10 of said primers in (1) located at or close to the borders and with a second 

overhang complementary to a unique restriction site downstream of the RB core 
sequence (e.g. Seal in pCDVIB) or complementary to a unique restriction site 
upstream of the LB core sequence (e.g. Nco\ in pCDVIB). 

3. (a) Digesting pCDVIB with Nart (in the pVS replicon) and Seal (downstream of 
15 RB core sequence) followed by a trimolecular ligation of 1) the resulting vector 

with 2) the digested PCR product encompassing the deleted pCDVIB fragment 
and introducing the recombination sites and 3) the dsDNA based on the 
annealed oligonucleotides and reintroducing the RB core sequence, 
(b) Digesting the vector finally obtained in (3a) with Bc/I (in Sm/Sp resistance 
20 marker) and Ncol (upstream of LB core sequence) followed by a trimolecular 

ligation of 1 ) the resulting vector with 2) the digested PCR product encompassing 
the deleted pCDVIB fragment and introducing the recombination sites and 3) the 
dsDNA based on the annealed oligonucleotides and reintroducing the LB core 
sequence. 

25 The described exemplary T-DNA vectors with introduced recombination sites serves as 
a starting point to insert gene(s) of interest and/or selectable marker gene(s) in 
between RB and LB of said T-DNA vector. 



Example 6 

30 Prediction of vector backbone transfer frequencies using induced 
Agrobacterium cultures. 

Agrobacterium strains containing any T-DNA vector can be induced for production of 
ssT-DNA strands by adding acetosyringone at a concentration of 100 pM to the culture 
medium (Durrenberger et al. 1989). The production of ssT-DNA strands can be 
35 analyzed using either one of the following methods (see included figure for schematic 
overview). 
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1 . Total DN A is isolated from the Agrobacterium cells containing a given T-DNA 
vector. Duplicate non-denaturing Southern blots (Tinland et al. 1995) of about 5 
pg of this DNA isolated from the different Agrobacterium strains are hybridized to 
two different probes: one recognizing a DNA sequence that is part of the T-DNA 

5 and one recognizing a DNA sequence that is part of the vector backbone 

flanking the left border. The hybridization signal of the probe recognizing the 
vector backbone sequences is absent or much reduced in the case of a T-DNA 
vector from which the left border is processed with high efficiency. The 
hybridization signal of the probe recognizing the T-DNA is, however, be 
10 constant. 

2. Total DNA is isolated from the Agrobacterium cells containing a given T-DNA 
vector. Two separate digests containing equal amounts of total DNA are set up, 
one with a restriction enzyme cutting the T-DNA at or very near to the left 
border, the other one with a restriction enzyme cutting within the T-DNA 

15 sequence. The ssT-DNA strands will, however, remain intact. Specific primer 

pairs are used in a quantitative PGR to amplify a T-DNA-borne sequence and a 
vector-borne sequence downstream of the left border. Again, no or much less 
vector backbone amplification product is obtained in the case of a T-DNA vector 
from which the left border is processed with high efficiency. The amounts of T- 

20 DNA-specific amplification product is, however, constant. 

For both types of analyses, an additional GFP (green fluorescent protein) expression 
cassette is cloned immediately downstream of the left border of the different T-DNA 
vectors. This cassette provides a fully known Vector backbone' sequence downstream 

25 of the left border. The GUS expression cassette is used as T-DNA-specific target. Both 
expression cassettes contain an intron in the GFP- and GUS-coding regions, 
respectively, to prevent expression of both markers in Agrobacterium. 
In Figure 6 both experimental setups are schematically drawn. Above the solid black 
line indicating the T-DNA vector, the hybridization approach is shown and below the 

30 solid line, the quantitative PCR approach is shown. 

Example 7 

Early analysis of vector backbone transfer sequences to plant cells. 

The constructs outlined in Example 6 and containing an additional GFP expression 
35 cassette downstream of the left border are used in a transient expression assay for 
assessment of vector backbone transfer to plant cells. To this end, Arabidopsis root 
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fragments are transformed with Agrobacterium (De Buck et al. 1999, Valvekens et al. 
1988) containing a T-DNA vector modified according to the invention. After 3 days of 
co-cultivation, roots are assessed first for transient expression of the GFP followed by 
assessment of transiently expressed GUS. The level of GFP expression is low or zero 
5 in the case of a T-DNA vector from which the left border is processed with high 
efficiency. The levels of GUS activities are, however, constant. 

Alternatively, the procedure of reverse transcription followed by PCR is followed for 
detecting and quantitating the levels of transiently accumulated GFP- and GUS- 
transcripts (Narasimhulu et al. 1996). Again, no or much less GFP-specific 
1 0 amplification product is obtained in the case of a T-DNA vector from which the left 
border is processed with high efficiency. The amounts of GUS-specific amplification 
product is, however, constant. 

Example 8 

1 5 Curing transformed plants of integrated vector backbone sequences. 

Plants (e.g. Arabidopsis) are transformed (see Example 1 ) with the exemplary T-DNA 
vector of Example 5 containing the FRT recombination sites. Regenerated plants are 
tested for vector backbone integration according to any of the methods described in 
Examples 2-3 or 6-7 and transgenic plants containing the T-DNA and vector backbone 

20 in their genome are selected. 

At flowering, one set of selected transgenic plants are cross-pollinated with pollen 
derived from another transgenic plant constitutively expressing the FLP site-specific 
recombinase while a second set of selected transgenic plants are cross-pollinated with 
pollen derived from a wild-type plant. Seeds are collected and sown on medium 

25 containing a selective agent. Surviving plants (due to the selectable marker in the T- 
DNA) of both crossings are analyzed again for the presence of T-DNA and for the 
presence of vector backbone. In a substantial part of the progeny of the crossings 
involving the FLP-expressing parent and containing the T-DNA, the vector backbone 
sequences are removed. In the progeny of the crossings involving the wild-type parent 

30 and containing the T-DNA, the vector backbone sequences are still present. 

Example 9 

Cloning of the virD locus and influence of an extra copy of the locus on vector 
backbone transfer. 

35 The sequences of the octopine-type and nopaline-type virD loci are known (Yanofsky 
et al. 1986, Jayaswal et al. 1987; and Wang et al. 1990, respectively). Specific primers 
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are designed enabling PCR amplification of the complete virD loci. These loci are 
subsequently subcloned in a plasmid carrying the P15a replicon (Chang and Cohen 
1 987) and a tetracyclin resistance marker derived from pAlter (Promega). The plasmids 
are subsequently mobilized to Agrobacterium containing a T-DNA vector. Transfer o1 
5 vector backbone sequences by said Agrobacterium strains is analyzed according to 
any of the methods described in Examples 2, 3, 5 or 6. 

Example 10 

Determination of the frequency of initiation of transfer at the left border 

10 Plants are transformed with the T-DNA vectors described in Figure 2. Total DNA is 
prepared and three primer sets are synthesized to perform different PCR reactions. 
The first primer set amplifies an internal T-DNA fragment, adjacent to the left border 
repeat (PCR fragment t). The second primer set amplifies a T-DNA/vector fragment 
spanning the T-DNA left border region and the vector region adjacent to the left border 

15 repeat (PCR fragment 2). Finally, the third primer set amplifies a vector fragment 
adjacent to the left border repeat (PCR fragment 3). Dependent on whether the vector 
transfer results from readthrough at the left border or initiation at the left border, 
different PCR fragments are amplified. When only PCR fragment 1 is present, the T- 
DNA transfer started at the right border and stopped at the left border and no vector 

20 DNA was transferred. When PCR fragments 1 ,2 and 3 are present, the T-DNA transfer 
started at the right border but didn't stop at the left border. When PCR fragments 1 and 
3 are present but not fragment 2, the vector transfer started at the left border, 
independently of T-DNA transfer from the right border. Performing these different PCR 
reactions for approximately 50 transformants thus yields the frequency of readthrough 

25 at the left border and the frequency of initiation at the left border repeat. The frequency 
of readthrough and initiation of vector transfer is determined for the different T-DNA 
vectors with the left border repeat in different contexts. 



30 
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Table 1. Presence (+) or absence (-) of vector sequences linked to the LB and RB of 
the T-DNA in the transgenic A. thaliana plants co-transformed with both K and Hsb T- 
DNA vectors. 



Number of co- 
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10 



a Indicates the number of transformants with a particular pattern of vector 
sequences present in the transformed plants. 

b Right (R) or left (L) external border repeat fragments are present of either of 
the T-DNA vectors, not linked to either of the borders. 
c Not tested 
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Table 2. Presence (+) or absence (-) of vector sequences linked to the LB and RB of 
the T-DNA in the transgenic N. tabacum plants transformed with the Ksb T-DNA vector. 



Number of co- 
transformants 3 


LB100 LB1000 RB100 RB1000 
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5 a Indicates the number of transformants with a particular pattern of vector 
sequences present in the transformed plants. 

b Right (R) or left (L) external border repeat fragments are present of either of 
the T-DNA vectors, not linked to either of the borders. 
0 Not tested 
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Claims 

1 . A T-DNA transformation vector comprising T-DNA with flanking T-DNA borders, 
characterized in that said T-DNA borders are modified resulting in preventing the 

5 integration of vector backbone sequences in the transformed cells and/or in curing 
transformed cells of integrated vector backbone sequences. 

2. The vector according to claim 1 wherein said modifications include a modification of 
the right T-DNA border comprising a single right border core sequence flanked by a 
right border outer region and/or a modification, including multiplication, of the left T- 

1 0 DNA border or part thereof. 

3. The vector according to claim 2 wherein said modification of said left T-DNA border 
results in efficient nicking at the left border core repeat by the nicking complex at least 
involving the VirD1 and/or VirD2 proteins. 

4. The vector according to any claims 1 to 3 wherein said modified left T-DNA border 
15 comprises a single left border core sequence that is flanked by a natural left border 

outer region and an intra-T-DNA left border proximal region with a preferable length of 
10 to 100 bp, preferably of 20-100 bp and which is enriched in the number of A- and T- 
residues, the percentage of AT-residues being 65 to 80%. 

5. The vector according to any of claims 1 to 3 wherein said modified left T-DNA 
20 border comprises a single left border core sequence that is flanked only by a natural 

left border inner region. 

6. The vector according to any of claims 1 to 3 wherein said modified left T-DNA 
border comprises only a single left border core sequence. 

7. The vector according to any of claims 1 to 3 wherein said modified left T-DNA 
25 border comprises the inclusion of at least two left border core sequences separated by 

a sequence of at least 10-20 bp optionally carrying stop codons in the three reading 
frames and in both directions; and wherein said tandem repeat of the left border core 
sequences is flanked only by a left border outer region. 

8. The vector according to any of claims 1 to 3 wherein said modified left T-DNA 
30 border comprises the inclusion of at least one integral nopaline-type left border region 

adjacent to and downstream or upstream of the integral octopine-type left border 
region. 
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9. A T-DNA transformation vector according to any of claims 1 to 8, characterized in 
that said modifications of the T-DNA borders comprise the addition of recombination 
sites downstream of the left border core sequence and the addition of recombination 
sites upstream of the right border core sequence, said recombination sites being 

5 organized as repeats. 

10. The vector according to claim 9 further comprising a second copy of a left border 
region upstream of and preferably adjacent to the single right border outer region and 
said recombination site upstream of the core sequence of said second left border 
region. 

10 1 1 . A vector according to any of claims 9 or 10 further comprising a recombinase gene 
located downstream of said recombination site downstream of said left border core 
sequence, and preferably, when present, adjacent to and downstream of the left border 
outer region. 

12. A vector according to claim 11 comprising said recombinase gene flanked by 
1 5 repeats of recombination sites and said vector further comprising a second copy of a 

left border region located downstream of said recombinase gene flanked by said 
recombination sites. 

13. A vector according to claim 12 further comprising additional recombination sites 
organized as repeats downstream of the second left border core sequence and 

20 upstream of the single right border core sequence. 

14. ,The vector according to any of claims 9 to 13 characterized in that said 
recombination sites are located adjacent to and downstream and/or upstream of the 
left- and/or right border core sequences or are separated downstream and/or upstream 
from the left- and/or right border core sequences by a sequence of at least 10-20 bp in 

25 length optionally carrying stop codons in the three reading frames and in both 
directions. 

15. A vector according to any of claims 9 to 14 wherein said recombination sites 
organized as repeats are defined as either site-specific recombination sites organized 
as direct repeats or as transposon border sequences organized as inverted repeats; 

30 and wherein said recombinase gene is defined as either a site-specific recombinase or 
a transposase gene, respectively. 

16. A vector for Agrobacterium-med\a\edi transformation, for agrolistic transformation of 
for gene therapy purposes comprising at least one modified T-DNA border as defined 
in any of claims 1 to 15, wherein said vector is chosen from binary transformation 



BNSDOCID: <WO 0144482A2_I_> 



WO 01/44482 



50 



PCT/EP00/12683 



vectors, super-binary transformation vectors, co-integrate transformation vectors, Ri- 
derived transformation vectors and T-DNA carrying vectors used in agrolistic 
transformation or gene therapy. 

17. A method for obtaining transgenic plants, yeasts, moulds or filamentous fungi 
5 consisting of an Agrobacterium-mediaXed transformation of said plants, yeasts, moulds 

or filamentous fungi with a transformation vector according to any of claims 1 to 16. 

18. A method for obtaining transgenic plants, yeasts, moulds or filamentous fungi 
consisting of an Agrobacterium-mediaXed transformation of said plants, yeasts, moulds 
or filamentous fungi with a transformation vector defined in any of claims 9 or 10 in 

1 0 combination with the supply of a recombinase or transposase, for curing said resulting 
transformed cells from integrated vector backbone sequences possibly originating from 
said vector. 

1 9. A method for obtaining transgenic plants, yeasts, moulds or filamentous fungi 
consisting of an Agrobacterium-med\a\ed transformation of said plants, yeasts, moulds 

15 or filamentous fungi with a transformation vector defined in any of claims 1 1 to 13 for 
curing said resulting transformed cells from integrated vector backbone sequences 
possibly originating from said vector, optionally in combination with the supply of a 
recombinase or transposase. 

20. A method for preventing the integration of vector backbone sequences in an 
20 >4grobacfer/um-mediated transformed cell comprising enhancing the efficiency of the 

nicking at the left border core sequence of a T-DNA vector, including vectors according 
to any of claims 1-16, by increasing the production of the VirD1 and/or VirD2 proteins 
comprised within the T-DNA nicking complex. 

21. The method of claim 20 involving the integration of at least one additional copy of 
25 the virD locus into the genome or into an extrachromosomal entity of Agrobacterium. 

22. The method of claim 21 wherein said additional copy of the virD locus is selected 
from the octopine-type virD locus or the nopaline-type virD locus. 

23. The method according to any of claims 17 to 19 further comprising increasing the 
production of VirD1 and/or VirD2 proteins comprised within the T-DNA nicking 

30 complex, using any of the methods of claims 20 to 22. 

24. The method of agrolistic-based transformation of a eukaryotic cell using a T-DNA 
vector modified according to the alterations as defined in the vectors of any of claims 1 
to 16. 
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25. A gene therapy method comprising any use of a T-DNA vector modified according 
to the alterations as defined in the vectors of any of claims 1 to 16. 

26. A recombinant host cell containing any of the vectors as defined in any of claims 1 
to 16. 

5 27. The host cell of claim 26 identified as being an Agrobacterium tumefaciens. 

28. A transgenic plant cell or plant obtainable by an Agrobacterium-med\a\e6 
transformation method according to any of claims 17 to 24, part thereof, or progeny 
thereof. 

29. A transgenic yeast, mould or filamentous fungus obtainable by an Agrobacterium- 
10 mediated transformation method according to any of claims 17 to 24. 
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